Click-words: learning to predict document keywords from a user perspective
نویسندگان
چکیده
منابع مشابه
Click-words: learning to predict document keywords from a user perspective
MOTIVATION Recognizing words that are key to a document is important for ranking relevant scientific documents. Traditionally, important words in a document are either nominated subjectively by authors and indexers or selected objectively by some statistical measures. As an alternative, we propose to use documents' words popularity in user queries to identify click-words, a set of prominent wor...
متن کاملExtracting Keywords from Digital Document Collections
An indexing tool was built to provide for one of several information seeking tasks. In ac cordance with the basic principles of work held by the HUMLE laboratory at SICS, a so lution regarding indexing would be a semi-automatic tool. This approach is also relevant as the continuation of the indexing project is conducted in co-operation with the Swedish Parliament, where a staff of professiona...
متن کاملKeyWorld: Extracting Keywords from a Document as a Small World
The small world topology is known widespread in biological, social and man-made systems. This paper shows that the small world structure also exists in documents, such as papers. A document is represented by a network; the nodes represent terms, and the edges represent the co-occurrence of terms. This network is shown to have the characteristics of being small world, i.e., highly clustered and ...
متن کاملLearning to Predict User Operations for Adaptive
Mixed-initiative systems present the challenge of nd-ing an eeective level of interaction between humans and computers. Machine learning presents a promising approach to this problem in the form of systems that automatically adapt their behavior to accommodate diierent users. In this paper, we present an empirical study of learning user models in an adaptive assistant for crisis scheduling. We ...
متن کاملJoint Learning of Chinese Words, Terms and Keywords
Previous work often used a pipelined framework where Chinese word segmentation is followed by term extraction and keyword extraction. Such framework suffers from error propagation and is unable to leverage information in later modules for prior components. In this paper, we propose a four-level Dirichlet Process based model (DP-4) to jointly learn the word distributions from the corpus, domain ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2010
ISSN: 1460-2059,1367-4803
DOI: 10.1093/bioinformatics/btq459